Bone conduction transducers for privacy

ABSTRACT

A method for routing audio content through an electronic device that is to be worn by a user. The method obtains a communication and determines whether the communication is private. In response to determining that the communication is private, the method drives a bone conduction transducer of the electronic device with an audio signal associated with the communication. In response to determining that the communication is not private, however, the method drives a speaker of the electronic device with the audio signal.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of pending U.S. application Ser. No.16/890,843 filed Jun. 2, 2020, which claims the benefit of and priorityto U.S. Provisional Patent Application No. 62/865,092, filed Jun. 21,2019, which are hereby incorporated by this reference in their entirety.

FIELD

An aspect of the disclosure relates a computer system for routing audiocontent that is deemed to be private through a bone conductiontransducer.

BACKGROUND

Headphones is an audio device that includes a pair of speakers, each ofwhich is placed on top of a user's ear when the headphones are worn onor around the user's head. Similar to headphones, earphones (or in-earheadphones) are two separate audio devices, each having a speaker thatis inserted into the user's ear. Headphones and earphones are normallywired to a separate playback device, such as a digital audio player,that drives each of the speakers of the devices with an audio signal inorder to produce sound (e.g., music). Headphones and earphones provide aconvenient method by which the user can individually listen to audiocontent without having to broadcast the audio content to others who arenearby. Sound produced by headphones, however, may leak into the ambientenvironment, broadcasting the audio content to others who are nearby.

SUMMARY

An aspect of the disclosure is a method performed by a computer systemfor determining how to route audio content of a communication through anelectronic device, such as a pair of wireless open-back headphones thatis to be worn by a user. The electronic device may include two or morechannels, each of which route audio content differently. For instance,one channel may be a “private” channel, through which audio content isrouted to a bone conduction transducer that outputs the audio asmechanical vibrations of the user's skull. Such vibrations may onlyheard by the user of the electronic device. In contrast, the electronicdevice may include another “non-private” channel, through which audiocontent is routed to at least one of the speakers of the open-backheadphones that outputs sound into (or towards) an ear of a user. Soundproduced by the speaker may be heard by not only the user of theelectronic device, but also others that are within a close proximity(e.g., persons sitting next to the user of the electronic device), sincesound produced by speakers of open-back headphones may leak out into theenvironment. The computer system obtains a communication, which may be atext message or an incoming telephone call, via a network. The computersystem determines whether the communication is private. For example, thecommunication may explicitly indicate that its content is to be private(e.g., the sender of the communication tags the communication as beingprivate). As another example, the computer system may determine that thecommunication is private based on an analysis of its audio content. Forinstance, the communication may contain words or phrases that areindicative of a private communication. In response to determining thatthe communication is private, the communication's audio content isoutputted through the bone conduction transducer. If, however, thecommunication is determined to not be private, its audio content isinstead outputted through the speakers.

The above summary does not include an exhaustive list of all aspects ofthe present disclosure. It is contemplated that the disclosure includesall systems and methods that can be practiced from all suitablecombinations of the various aspects summarized above, as well as thosedisclosed in the Detailed Description below and particularly pointed outin the claims filed with the application. Such combinations haveparticular advantages not specifically recited in the above summary.

BRIEF DESCRIPTION OF THE DRAWINGS

The aspects of the disclosure are illustrated by way of example and notby way of limitation in the figures of the accompanying drawings inwhich like references indicate similar elements. It should be noted thatreferences to “an” or “one” aspect of the disclosure are not necessarilyto the same aspect, and they mean at least one. Also, in the interest ofconciseness and reducing the total number of figures, a given figure maybe used to illustrate the features of more than one aspect of thedisclosure, and not all elements in the figure may be required for agiven aspect.

FIG. 1 shows a progression of stages of a headset receiving an incomingcall that is routed to a bone conduction transducer.

FIG. 2 shows a block diagram of the headset.

FIG. 3 is a flowchart of one aspect of a process to route acommunication to either a bone conduction transducer or a speaker of theheadset.

FIG. 4 is a flowchart of one aspect of a process to determine whether acommunication is private or non-private.

FIG. 5 is a flowchart of one aspect of a process to route an audiosignal to either a bone conduction transducer or a speaker.

DETAILED DESCRIPTION

Several aspects of the disclosure with reference to the appendeddrawings are now explained. Whenever the shapes, relative positions, andother aspects of the parts described in the aspects are not explicitlydefined, the scope of the disclosure is not limited only to the partsshown, which are meant merely for the purpose of illustration. Also,while numerous details are set forth, it is understood that some aspectsof the disclosure may be practiced without these details. In otherinstances, well-known circuits, structures, and techniques have not beenshown in detail so as not to obscure the understanding of thisdescription.

FIG. 1 illustrates a user 100 wearing a headset 115 that routes acommunication 140 to a bone conduction transducer (hereafter referred toas “BCT”) 135, in response to determining that the communication isprivate. Specifically, this figure illustrates two stages 105 and 110 inwhich the headset 115 obtains the communication, determines that thecommunication 140 is private, and then drives the BCT 135 with an audiosignal associated with the communication in order to output thecommunication, as for example vibrations.

The headset 115 illustrated in this figure is a pair of over-the-earopen-back headphones that allows some sound produced by at least onespeaker of the headphones to leak into the environment via a speakergrill 130 and/or allows some ambient sounds from the environment to leakinto the earcup 118 via the speaker grill 130 be heard by the user. Inone aspect, the headset 115 may be any electronic device that includesat least one speaker and at least one BCT, and is capable of being wornby a user and capable of performing one or more audio processingoperations. For instance, the headset 115 may be a pair of in-ear,on-ear, or over-the-ear headphones (open-back, closed-back, or semi-openheadphones). In one aspect, the headset 115 is at least one earphone(e.g., earbud) that is configured to be inserted into an ear canal ofthe user. In one aspect, the headset 115 may also be any electronicdevice that is capable of performing networking operations. Forinstance, the headset 115 may be a wireless electronic device that isconfigured to establish a wireless connection with a media playbackdevice (not shown), such as a smart phone, a tablet, a laptop, etc.,over a wireless computer network, using e.g., BLUETOOTH protocol or awireless area network. During the established wireless connection, theheadset 115 may exchange (e.g., transmit and receive) data packets(e.g., Internet Protocol (IP) packets) with the media playback device.In one aspect, this wireless connection is pairing the headset 115 withthe media playback device in order to allow the headset 115 to performat least some operations that may otherwise be performed by the mediaplayback device. For example, as described herein, the user 100 mayparticipate in a handsfree phone call that is initiated by the mediaplayback device, but conducted through the headset 115. In one aspect,the electronic device may be a wearable device, such as smart glasses ora smart watch.

As used herein, a “communication” may refer to an electronic message orsignal (e.g., audio signal) that is obtained or exchanged, between theheadset 115 and another electronic device. Examples of an electronicmessage may include an instant message, a personal message, a textmessage (e.g., short message server (SMS), multimedia message (MMS),etc.), an electronic mail, and a voice mail. In one aspect, theelectronic message may include text, image data (which may includedigital images and/or video), and/or an audio signal (e.g., as audiodata). Another type of communication may include a telephone (e.g.,voice) call in which an originating electronic device calls the headset115 over a public switched telephone network (PSTN) and/or to avoice/data network (e.g., that supports voice-over-IP). In one aspect,the voice-data network may couple to the headset 115 over any type ofcomputer network (e.g., local area networks, wireless area networks,etc.). In this case, the communication may be associated with an audiosignal, such as a downlink signal during the phone call.

In one aspect, the communication may be obtained from a computer program(or application) that is being executed by at least one processor in theheadset 115. In another aspect, the communication may be obtained froman application executing on another electronic (e.g., media playback)device that is paired with the headset 115. Specifically, thecommunication may be an indication or notification produced by anapplication that output of an audio signal is desired. For example, thecommunication may be produced by a messaging application to alert theuser of a SMS message has been recently received. Such a communicationmay be associated with an audio signal containing speech of “A messagehas been received from Hal.” As another example, the communication maybe produced by a telephony application when a telephone call is beingreceived. Specifically, the telephony application may produce thecommunication upon receiving a control signal from a voice network thata telephone call is to be received. Such a communication may beassociated with an audio signal containing a ring tone and informationregarding the call (e.g., telephone number). In one aspect, thecommunication may be an indication of an audible reminder that is to beoutputted at a certain time by a calendar application. In one aspect,the communication is produced by a media playback application thatindicates audio output, such as music, is desired.

In another aspect, the communication may be a virtual personal assistant(“VPA”) response or alert from a VPA that is executing on the headset115 and/or executing on another electronic device that is paired withthe headset 115 (e.g., smart phone). For example, the response or alertmay be issued by the VPA in response to a question from the user. In oneexample, the user may trigger the VPA by saying a trigger phrase (e.g.,“Hey Hal”) and then may ask the VPA a question (or issue a command),such as “what time is it?” In response, the VPA will generate (orproduce) an audible response (e.g., “It is five o'clock.”) to beincluded within (or associated with) the VPA response or alert. Theaudible response may be an audio signal that is rendered and outputted(or played back) by the headset 115.

In another aspect, and as described herein, the communication may be anaudio signal. Specifically, rather than being an indication ornotification that an audio signal is to be outputted, the communicationmay be the audio signal. Continuing with the previous examples, theaudio signal may be the audio signal of the audible response from theVPA. In one aspect, the audio signal may include user-desired audiocontent, such as a musical composition.

As used herein, a communication is “private” when the user 100 of theheadset 115 is the only person that is meant to view and/or hear thecontents of the communication. An example of such a communication may bean electronic message from a family member or a doctor. As describedherein, a communication may be determined to be private based on severalfactors, such as the content contained therein. Conversely, acommunication is “non-private” when may be heard and/or viewed by othersbesides the user. An example of such a communication may be a virtualassistant alert relating to navigation instructions.

The headset 115 also includes a right earcup 118 that has a microphone120, a BCT 135, and a speaker grill 130, behind which is at least onespeaker (e.g., speaker 220 of FIG. 2). Specifically, the microphone 120is configured to sense (or capture) sound (e.g., speech of the user 100)that is within the environment, and convert the sound into an audiosignal (or microphone signal). The BCT 135 is positioned on a portion ofthe earcup 118 that rests upon (or abuts) at least a portion of theuser's head. For instance, the BCT 135 is positioned inside a cushion ofthe earcup. The BCT 135 is configured to output an audio signal asmechanical vibrations of the user's head. In this figure, the BCT 135abuts the mastoid process that is a portion of the temporal bone of aright side of the user's skull. The BCT 135 conducts sound to the innerear of the user through vibrations that are picked up by the mastoidprocess of the skull. In one aspect, the left earcup (not shown) of theheadset 115 may include the same number of elements described herein atsimilar locations (e.g., another BCT that abuts the mastoid process on aleft side of the user's head).

In another aspect, the number of elements of the headset 115 may bedifferent and/or may be positioned at different locations about theheadset 115. For instance, the BCT 135 may be positioned differently onthe headset 115 in order to come into contact with a different part ofthe user's head. As one example, the BCT may come into contact with theuser's temporal bone, temporomandibular joint, mandible, etc.

As described herein, an audio signal associated with a communication maybe outputted by the speakers (e.g., 220) and/or the BCT 135. In oneaspect, however, the headset 115 may route an audio signal to (one ormore of) the speakers in the open-back earcup by default, rather than tothe BCT 135. Specifically, under most conditions, the headset 115 mayroute an audio signal to speaker 220. The default towards this speakermay be due to several factors. One factor may be that air conductionprovides higher quality sound to a user than bone conduction. Forexample, bone conduction has a narrower frequency range than airconduction, which is due to bypassing most of the acoustic auditoryprocesses and directly vibrating the user's cochlea. Furthermore, unlikebone conduction, air conduction may preserve spatial audio cues throughthe use of spatial filters, such as head related transfer functions(HRTFs) that account for diffraction and reflections of sound on partsof a user's body (e.g., head, pinnae, etc.). Thus, users in general mayprefer air conduction since sound has a higher fidelity than boneconduction. In addition to reduced quality, bone conduction devices maybe uncomfortable to wear for some users due to required head contact.

Although providing better sound quality than bone conduction, airconduction has drawbacks, such as “sound leakage,” which is sound thatis inadvertently heard by others within close proximity to the user 100(e.g., a person sitting next to the user 100 at a restaurant). Forinstance, in the case of open-back headphones, the sound leakage may besound that is audible (e.g., having a sound pressure level above 0dBSPL), up to a threshold distance away from the user (or the user'sear). Sound leakage may also be compounded based on the volume level ofthe sound produced by the speaker in the earcup 118. Sound leakage forbone conduction, on the other hand, is a less significant problem. Forinstance, if audio content were to be played back through the BCT 135and speaker 220 of the earcup 118 at similar playback levels, thespeaker 220 would have more sound leakage than the BCT. In one aspect,sound leakage due to bone conduction may be virtually non-existent up toa volume level. Thus, even if there is sound leakage, the amount is farless than with air conduction and may not be perceived by others due todiffusion in the air.

The user 100 may want to avoid (substantial) sound leakage from airconduction under certain conditions. For example, this sound may be verydistracting to others who are nearby, especially if the environment haslittle ambient sound, such as a library. Besides being distracting toothers, the user 100 may want to avoid routing audio signals to thespeaker of the earcup 118 when audio content of the audio signal is of aprivate nature (e.g., a phone call from doctor) in order to preventothers from eavesdropping, due to the sound leakage via the speakergrill 130. To prevent others from hearing this audio content, theheadset 115 may route audio signals that are private to the BCT 135 inorder to output the audio as vibrations that are only heard by the user100. Those who are within a close proximity of the user 100, however,will be unable to (or are less likely to) decipher the audio outputted(or produced) by the BCT 135. Thus, the headset 115 is configured toroute audio content that is determined to be private to the BCT 135, androute audio content that is determined to be non-private to the speaker220 (and/or a speaker of a left earcup of the headset 115). More abouthow the headset 115 makes this determination is described herein.

Returning to FIG. 1, stage 105 illustrates the headset 115 playing backaudio content (e.g., music) 125 through the speaker 220, and as aresult, at least some of the music 125 is leaking out into theenvironment through the grill 130. In one aspect, the headset 115 mayroute audio signals containing the audio content to the speaker of theearcup 118 based on a determination that the content (or audio signal)is not private. This stage further illustrates that while the music 125is playing back, the headset 115 receives (obtains) an incoming phonecall 140 from Dr. Smith. Specifically, a telephony application executingon the headset 115 may be made aware of or may identify an incomingphone call by the operating system, OS, of the headset 115. For example,the OS may alert the telephony application of a telephony change at theheadset 115 (e.g., receipt of a control signal from a cell tower thatindicates an incoming call). In addition to identifying the incomingcall, the telephony application may identify information regarding theincoming call, such as a telephone number of the originating device(e.g., through a caller-ID mechanism). From this information, theheadset 115 may determine whether the incoming phone call is a privatephone call in order to determine how to properly route the call. Forinstance, the headset 115 may compare the telephone number of theincoming phone call to entries of a contacts list stored within theheadset 115, where each entry of the contacts list associates a phonenumber with a known person (e.g., a person whom the phone numberbelongs) to identify a matching entry. In one aspect, each entry mayinclude additional user defined settings. For instance, the user of theheadset 115 may designate that communications received (e.g., phonecalls, electronic messages, etc.) from a phone number of an entry aredeemed private, and are to be routed to the BCT 135. More about how theheadset 115 determines how to route a communication is described herein.

Stage 110 illustrates that the phone call has been determined to beprivate (e.g., based on the user settings of a corresponding entrywithin the contacts list), and, in response, the headset 115 is routingthe incoming call to the BCT 135. In one aspect, upon determining thatthe phone call is private, the headset 115 may present an indication orringing alert to the user 100, indicating that there is an incoming call140. For instance, the headset 115 may output a ring tone and/or maypresent a graphical user interface (GUI) on a display of the headset 115(not shown). Upon the user 100 accepting the call (e.g., pressing abutton or through a verbal command to answer the call), the headset 115may obtain a downlink audio signal of the incoming call, and use thesignal to drive the BCT 135 to produce vibrations. In one aspect, upondetermining that the call is a private call, the headset 115 may routethe ringing alert to the BCT 135, rather than (or in addition to)routing the alert to the speaker of the earcup 118. In one aspect, theheadset 115 may automatically (e.g., without user interference) answerthe incoming call. Thus, the headset 115 drives the BCT 135 with thedownlink signal that contains speech 145 of Dr. Smith.

This stage also illustrates that upon routing the downlink audio signal,the headset 115 has ceased to output the music 125. For instance, inthis case, the headset 115 may mute the music 125 to prevent the speakerof the earcup 118 from outputting all of the sound. In one aspect,rather than muting the music 125, the headset 115 may continue to outputthe music 125, while the user 100 engages in the phone call with Dr.Smith. In another aspect, the headset 115 may adjust sound output of theearcup's speaker(s), such as reducing sound output volume and/orreducing a portion of the spectral content of the music (e.g., applyinga high-pass or band-pass filter). Once the user 100 terminates the phonecall, the BCT 135 may cease to output the downlink audio signal, and theheadset may resume playing back the music 125 through the speaker(s) ofthe earcup 118.

FIG. 2 shows a block diagram of the headset 115 according to one aspectof the disclosure. The headset 115 includes a camera 205, a microphonearray 210 of two or more microphones 120, a BCT 135, a left earcup 220with at least one speaker 225, the right earcup 118 with at least onespeaker 230, a controller 215, at least one display screen 235, and anetwork interface 240. In one aspect, the headset 115 may include fewerelements, such as not having the display 235 and/or may include moreelements, as described herein. In some aspects, at least some of theseelements are integrated into a housing of the headset 115. In oneaspect, the headset 115 may be a wireless device, as previouslydescribed. In one aspect, the network interface is configured toestablish a wireless communication link with a wireless access point inorder to exchange data with an electronic server over a wireless network(e.g., the Internet). In another aspect, the network interface 240 isconfigured to establish a wireless communication link (e.g., pair) withthe other electronic device in order to exchange data.

In one aspect, the headset 115 may be a part of a computer system, asdisclosed herein, that includes a separate media playback device, suchas a smart phone, which is paired with the headset 115 by establishing a(e.g., wired and/or wireless) connection. In one aspect, the (e.g.,programmed processor of the) media playback device may perform one ormore of the operations described herein, such as audio/image processingoperations. In another aspect, at least some of the elements of theheadset 115 described herein (e.g., the BCT 135, the display 240, etc.)may be a part of different electronic devices within the computersystem. In this case, the different electronic devices may communicatewith the headset 115 and/or each other via a wireless or wiredcommunication link. For example, rather than the headset 115 having twoearcups 118 and 225, the headset may be wirelessly coupled (e.g.,paired) with separate audio receiver devices that each includes at leastone speaker (e.g., wireless earphones or earbuds). In this example, theheadset 115 may exchange audio data with both these earphones in orderfor the speakers of the earphones to output sound.

In one aspect, the camera 205 is a complementarymetal-oxide-semiconductor (CMOS) image sensor that is capable ofcapturing digital images as image data that represent a field of view ofthe camera 205, where the field of view includes a scene of anenvironment in which the headset 115 is located. In some aspects, thecamera 205 may be a charged-coupled device (CCD) camera type. The camera205 is configured to capture still digital images and/or video that isrepresented by a series of digital images. In one aspect, the camera isin a frontal position of the headset 115 with respect to a user. Inanother aspect, the camera may be positioned differently and/or theremay be more than one camera. The BCT 135, as described herein, may beany type of electronic component that is configured to output an audiosignal as mechanical vibrations. In one aspect, the headset 115 mayinclude a driver amplifier (not shown) coupled to the BCT 135 that canreceive an analog input from a respective digital to analog converter,where the later receives its input digital audio signal from thecontroller 215.

Each microphone 120 of the microphone array 210 may be any type ofmicrophone (e.g., a differential pressure gradientmicro-electromechanical system (MEMS) microphone) that is configured toconvert acoustic energy caused by sound waves propagating in an acousticenvironment into a microphone signal. Each speaker 225 and 230 of theearcups 220 and 118, respectively, may be an electrodynamic driver thatmay be specifically designed for sound output at certain frequencybands, such as a subwoofer, tweeter, or midrange driver, for example. Inone aspect, the speakers may be a “full-range” (or “full-band”)electrodynamic driver that reproduces as much of an audible frequencyrange as possible. Each speaker “outputs” or “plays back” audio byconverting an analog or digital speaker driver signal into sound. In oneaspect, the headset (or the earcups) includes a driver amplifier (notshown) for each speaker that can receive an analog input from arespective digital to analog converter, where the later receives itsinput digital audio signal from the controller 215.

The display screen 235, as described herein, is configured to displayimage and/or video data to the user of the headset 115. In one aspect,there may be one display. In another aspect, there may be at least onedisplay for each eye of the user.

The controller 215 may be a special-purpose processor such as anapplication-specific integrated circuit (ASIC), a general purposemicroprocessor, a field-programmable gate array (FPGA), a digital signalcontroller, or a set of hardware logic structures (e.g., filters,arithmetic logic units, and dedicated state machines). The controller215 is configured to determine whether a communication is private ornon-private, and based on the determination to route an audio signalassociated with the communication to an appropriate output (e.g., theBCT 135 or at least one of the speakers 225 and 230). More about how thecontroller makes this determination is described herein.

In one aspect, the controller 215 is configured to perform imageprocessing operations upon digital images captured by the camera 205.Specifically, the controller 215 is configured to perform objectrecognition (e.g., through the use of an object recognition algorithm)upon the digital images (image data) in order to detect objects that arecaptured within the field of view of the camera 205. For example, thecontroller 215 may process the digital images to identify patternstherein (e.g., structural patterns) and compare them to previouslystored patterns therein. Once a matching pattern is found, thecontroller 215 is said to detect the object within the digital image. Inone aspect, the previously stored patterns do not need to match exactlyto the identified patterns within the digital images. For instance, thepatterns may match within a tolerance (e.g., 5%, 10%, 15%, etc.).

In another aspect, the controller 215 is further configured to receivemicrophone signals from at least some of the microphones 120 of themicrophone array 210, to perform audio processing operations thereon.Specifically, the controller 215 may include a sound pickup beamformerthat can be configured to process the microphone signals to formdirectional beam patterns for spatially selective sound pickup incertain directions, so as to be more sensitive to one or more soundsource locations. For example, referring to FIG. 1, a microphone arraymay direct a beam pattern towards the user's mouth in order capture theuser's speech, while minimizing undesired sounds and noises within theenvironment.

In one aspect, the controller is further configured to process inputaudio for outputting through one or more of the headset's speakers. Forexample, the controller 215 is to receive an input audio signal(channel) of a piece of sound program content to be rendered by thecontroller and drive at least one speaker with the input audio signal tooutput audio content contained within the signal, such as the music 125being played back in FIG. 1. In one aspect, the controller may receivethe input audio signal from a programmed processor that is running amedia player application program. Alternatively, there may be more thanone input audio channel, such as a two-channel input, namely left andright channels of a stereophonic recording of a musical work, or theremay be more than two input audio channels, such as for example theentire audio sound track in 5.1-surround format. In one aspect, whenthere are multiple input audio channels, they may be downmixed toproduce a signal downmixed audio channel. In one aspect, the controller215 is configured to spatially render an input audio signal by applyingspatial filters (e.g., head-related transfer functions (HRTFs)) toproduce at least one binaural signal for each earcup. In one aspect, theHRTFs may be personalized for the user of the headset 115 in order toaccount for the user's anthropometrics.

In one aspect, the controller 215 is configured to perform speechrecognition (e.g., through the use of a speech recognition algorithm) toanalyze an audio signal to find (or recognize) speech therein.Specifically, the controller may analyze audio data of the audio signalaccording to the speech recognition algorithm to identify a word orphrase contained therein. In one aspect, the controller may performspeech recognition upon text to identify a word or phrase. In anotheraspect, the controller 215 is further configured to perform atext-to-speech (TTS) algorithm to produce a synthesized speech versionof text. Specifically, the TTS algorithm produces an audio signal thatincludes speech of the text, which as described herein may be obtainedin a communication, such as an electronic message. Alternatively, theaudio signal containing the speech may be retrieved as previouslyrecorded human speech from a remote electronic server, via a computernetwork.

In some aspects, the headset 115 may include additional elements. Forinstance, the headset 115 may include one or more “extra-aural” speakersthat may be arranged on the headset 115 (e.g., on the right earcup 118)to project (or output) sound directly into the environment. This is incontrast to the speakers of in-ear, on-ear, or over-ear headphones, suchas speakers 225 and 230 that produce sound directly into a respectiveear of the user 100 (and may leak sound into the environment). In oneaspect, headset 115 may include an array of (two or more) extra-auralspeakers that are configured to project directional beam patterns ofsound at locations within the environment. In some aspects, thecontroller 215 may include a sound output beamformer that is configuredto produce speaker driver signals which when used to drive the two ormore extra-aural speakers, may produce spatially selective sound output.

FIG. 3 is a flowchart of one aspect of a process 300 to route acommunication to at least one of a bone conduction transducer or aspeaker. In one aspect, the process 300 may be performed by thecontroller 215 that is operating in the headset 115, as described inFIG. 2. In FIG. 3, process 300 begins by obtaining a communication (atblock 305). As described herein, a communication may be one of anelectronic message, such as a SMS message, a virtual assistant responseor alert, and/or an audio signal, such as an incoming phone call(downlink) signal. In one aspect, the communication may be a controlsignal indicating that the headset 115 is receiving the incoming call.In one aspect, the communication may be obtained via a network fromanother electronic device. In one aspect, the communication may be anindication or notification that output of an audio signal is desired.For instance, the controller 215 may receive an indication (e.g., arequest) from an application running locally on the headset 115 tooutput audio, such as an audible reminder from a calendar application.In another aspect, the indication may be associated with an audio signalcontaining audio content to alert the user of the headset that acommunication has been (or is being) received. The process 300determines whether the communication is private (at decision block 310).Specifically, the controller 215 may make this determination based on atleast one of several factors. For example, the controller 215 maydesignate the communication as private based on the origin of thecommunication, such as a telephone number of an incoming call asdescribed in FIG. 1. As another example, the controller may analyze thecontent of the communication to identify indications of privacy, such aspredefined words in a SMS message or the audio signal that is associated(or obtained along) with the communication. When the communication is anaudio signal, the controller may determine whether the audio signal isto be private based on identified metadata associated with the audiosignal that indicates whether or not the audio signal should be private.More about how the controller 215 makes this determination is describedin FIGS. 4-5.

In response to determining that the communication is private, theprocess 300 drives the BCT 135 of the headset 115 with an audio signalassociated with the communication (at block 315). For example, when thecommunication includes an audio signal (or is an audio signal)containing audio data (e.g., a voicemail, a downlink audio signal of anincoming call, user-desired content, etc.), the controller routes theaudio signal to the BCT 135 based on (e.g., an analysis of) the audiodata. If, however, the communication includes text, the controller 215may convert the text to a speech signal (e.g., using the TTS algorithmdescribed herein), and route the speech signal to the BCT. In response,however, to determining that the communication is not private, theprocess 300 drives at least one speaker 225 and/or 230 with the audiosignal associated with the communication (at block 320). In one aspect,the controller may drive the BCT with a different signal than one usedto drive either (or both of) the speakers in the earcups of the headsetfor a communication. For instance, since each speaker may have a fullerspectral range than the BCT, an audio signal used to drive each speakermay include more spectral content than a signal used to drive the BCT.In another aspect, the different signals may include different (orsimilar) portions of a communication.

FIG. 4 is a flowchart of one aspect of a process 400 to determinewhether a communication is private or non-private. In one aspect, thisprocess 400 is performed by the controller 215 of FIG. 2. The process400 begins by obtaining a communication (at block 405). For instance,the communication may have been obtained by the headset 115, via thenetwork interface 240 over a computer network, from another electronicdevice for example. The process 400 determines if the user of theheadset has requested (or instructed) that the obtained communication beprivate (at decision block 410). Specifically, the headset may alert theuser that the communication has been obtained in order to determine howthe user may want to route the communication. Specifically, uponobtaining the communication, the headset may present an alert message tothe user. For example, the headset may drive at least one speaker 225and 230 to output an audible alert, for example in the case of a textmessage, the controller 215 may output speech through the speakersindicating “Message received from Dr. Smith”. In one aspect, the headsetmay present a visual alert through the display 235.

In one aspect, the user of the headset 115 may instruct the controller215 to route the communication to the BCT 135 through a user command.For example, the user may issue a verbal command that is captured by amicrophone 120, and recognized as a command through the speechrecognition algorithm, described herein. As another example, the usermay select a physical button on the headset 115, or a UI item (e.g., anitem which when pressed routes the communication to the BCT 135) on atouchscreen display (e.g., either the display 235 or an external displaythat is accessible to the user) of the headset 115 or of a device pairedwith the headset, such as a smartphone.

In another aspect, the controller 215 may monitor for physical gesturesof the user that are indicative of an instruction to route thecommunication to the BCT 135. For example, the controller 215 maymonitor sensor data from sensors integrated into the headset 115 and/orsensors that are integrated into electronic devices that areelectrically coupled to the headset 115. For instance, the sensors mayinclude proximity sensors, motion sensors, microphones, etc.). As anexample, a proximity sensor may be positioned next to the user's ear.Upon determining that an object is within a threshold distance of thesensor according to the proximity sensor data, the controller maydetermine that the user is covering the ear. Such a gesture may beindicative of a desire to route the communication to the BCT. In oneaspect, the controller may identify the instruction when the proximitysensor data from the proximity sensor indicates that an object ispositioned over the ear, and remains there for a period of time (e.g.,five seconds).

If the user requests that the communication be private, the process 400routes the communication through a private channel (at block 440).Specifically, the controller 215 may drive the BCT 135 with an audiosignal associated with the communication, as described herein. In oneaspect, the audio signal may be a part of the communication. In anotheraspect, the audio signal may be generated (or produced) from thecommunication from another audio signal (or audio data) contained withinthe communication. As another example, the audio signal may be a speechsignal produced using a TTS algorithm upon text included within thecommunication, as described in FIG. 2.

In another aspect, the private channel may be (or include) a displaychannel. Specifically, the communication may be presented on the displayscreen 235 of the headset 115, as text, image, and/or video data of thecommunication. If, however, the communication includes an audio signal,the controller 215 may perform a speech-to-text algorithm to convert thesignal to words in order to present the words on the display screen. Insome aspects, the controller may output the communication to both theBCT and the display screen. In another aspect, the controller 215 mayoutput different portions of a communication through the BCT and thedisplay screen (e.g., in the case of an audio/video communication, theaudio may be outputted through the BCT 135 and the video may bepresented by the display screen 235).

If, however, the user of the headset 115 does not request that thecommunication be deemed private the process 400 determines whethercommunications from a sender of the communication are deemed (or definedpreviously) as private (at decision block 415). In one aspect, thecontroller 215 may wait for instructions from the user of the headsetfor a threshold of time (e.g., five seconds) after alerting the user ofthe obtained communication. If no instructions are received, however,the controller 215 may proceed to decision block 415. Specifically, asdescribed in FIG. 1, the controller may base the determination atdecision block 415 according to user settings stored within the headset(and/or the computer system in which the headset is a part of). Forexample, the settings may be a part of entries of a contacts list storedin memory of the headset 115. Communications may be associated with thesender through identifying information (e.g., metadata) that iscontained therein. For instance, the metadata may indicate the sender'sname, telephone number, user account name, email address, etc. Similarinformation may be stored within the contacts list that is used toidentify known senders. The controller compares the metadata of theobtained communication with identifying information of the entries inthe contacts list to identify a matching entry. Once the matching entryis found, the controller determines the sender of the communication. Thecontroller 215 determines whether the communication is private based onthe user defined settings associated with the sender's matching entry inthe list that indicate the communication is private. In one aspect, theuser defined settings may indicate that all types of communications(e.g., telephone calls, electronic messages, etc.) from this sender aredeemed private. In another aspect, the settings may differentiatebetween the types of communications. For example, the settings mayindicate that telephone calls from a sender are private, whileelectronic messages from the same sender are non-private.

In one aspect, the settings may be user defined or learned over time.For example, a user of the headset 115 may define a private settingthrough a graphical user interface (GUI) on a display of the headset (oran electronic device paired with the headset 115, such as a smartphone).The user may navigate to an entry in the contacts list associated withthe sender and interact with a user interface (UI) switch in which theuser may toggle between “private” and “non-private”. In another aspect,the settings may be defined through machine learning algorithms thatmonitor user instructions when communications are received from thisparticular sender. For instance, if the user continuously (e.g., morethan once) provides instructions to route communications from thissender to the BCT 135, this may be an indication that the user of theheadset wants future communications from this sender to be routed in asimilar fashion. As a result, the headset 115 may automatically (e.g.,without user intervention) adjust user settings to indicate thatcommunications form this sender art to be deemed private.

If communications from this sender are deemed private, the process 400proceeds to block 440 to route the communication through the privatechannel. If, however, communications from this sender are not deemedprivate, the process 400 determines if the sender has requested thiscommunication be private (at decision block 420). Specifically, thecontroller 215 may analyze the metadata contained within thecommunication to determine is the sender has tagged (or included arequest that) the communication be designated as private. In one aspect,the sender may have tagged the communication before sending it to theheadset 115 (e.g., through a selection of a UI item). If thecommunication includes such a tag or request, the process 400 proceedsto block 440.

If, however, the sender has not requested the communication be private,the process 400 performs content analysis upon the communication (atblock 425). For example, the controller 215 may analyze the (e.g., text,audio data, etc. of the) communication according to a speech recognitionalgorithm, as described herein, to identify a word or phrase containedtherein. As another example, when the communication includes image dataor video data, the controller 215 may perform object recognition, asdescribed herein, to identify an object contained therein. As anotherexample, when the communication is an audio signal, the controller 215may analyze the content of the audio signal and/or any metadataassociated with the audio signal. The process 400 determines if thecommunication is private based on the content analysis (at decisionblock 430). For instance, the controller 215 may determine if identifiedwords and/or phrases are defined as private. Specifically, thecontroller 215 may perform a table lookup into a data structure thatincludes words and phrases that are deemed private. If a matching wordor phrase is found within the data structure, the process 400 mayproceed to block 440. As an example, the communication may be a textmessage that starts with “I have a secret to tell you,” where the term“secret” indicates that the rest (or whole) communication is meant to beprivate. In one aspect, the sender may include an explicit indicationthat the communication is private, such as writing in the beginning of atext message “This is private.” In one aspect, a similar table lookupmay be performed with respect to identified objects. As another example,when the communication is an audio signal, metadata identified to beassociated with the audio signal may indicate the audio signal isprivate, or may indicate that the audio signal includes inappropriatecontent. If the controller 215 determines that the audio signal is to bedesignated as private according to the metadata, the process proceeds toblock 440.

Otherwise, if the communication is not private based on the contentanalysis, the process 400 routes the communication through a non-privatechannel by driving one or more of the speakers 225 and/or 230 with theaudio signal associated with the communication (at block 435).

Some aspects perform variations of the process 400. For example, thespecific operations of the process 400 may not be performed in the exactorder shown and described. The specific operations may not be performedin one continuous series of operations and different specific operationsmay be performed in different aspects. For example, in one aspect, theprocess 400 may omit the operations described in block 410. As a result,the controller 215 may perform these operations automatically (e.g.,without user intervention). In another aspect, if at any operationsdescribed herein it is determined that the communication is deemed to benon-private, the process 400 may proceed to block 435. For instance, ifcommunications from this sender are deemed to be non-private based onuser settings at decision block 415 and/or the sender has requested thecommunication to be non-private at block 420, the process 400 wouldroute the communication to one or more of the speakers of the headset115.

In some aspects, the process 400 may be performed to determine whether anotification of a communication is to be deemed private or non-private,rather than the communication itself. For instance, the process 400 maybe performed with respect to whether to route a ring tone of an incomingphone call through the private or non-private channel. Once routed, theuser may than accept the call (or message), and in response, the headset115 may route the communication through the same channel as the channelused to route the notification.

In some aspects, the controller 215 may route audio content (e.g.,music, etc.) other than communications through the BCT 135. Some audiocontent may be known to be better suited to be routed through theprivate channel. An example of such audio content may be a translationof text that the user of the headset 115 wants translated from alanguage that the user does not know to a familiar language. Forinstance, a translation application executing on the headset may producea speech signal from text, where the speech signal is in a differentlanguage from the text. The controller 215 may than obtain the speechsignal from the translation application and route the speech signal tothe private channel. In one aspect, the determination of whether toroute such audio to the private or non-private channel is according tothe application that provided the audio. In one aspect, the controller215 may determine that certain audio content be routed to the BCT 135through machine learning, as described herein.

In one aspect, the headset 115 may reroute an audio signal that iscurrently being outputted through either the speakers of the headsetand/or the BCT 135 upon determining that an obtained communication isprivate (or non-private). For example, the headset may determine that anaudio signal associated with a communication is to be routed through thenon-private channel, and thus drives at least one of the speakers of theheadset with the audio signal. While outputting the audio signal, thecontroller 215 may determine that the user of the headset is performinga physical gesture in order to reroute the audio signal through theprivate channel. For instance, the controller 215 may identify that theuser has placed at least one hand over the user's right ear based onimage data captured by the camera 205. As a result, the controller 215may cease driving the speaker(s) of the headset, and begin driving theBCT 135 with the audio signal. In one aspect, once the user performs theopposite gesture (e.g., removing the hand from the ear), the controller215 may revert the audio signal back through the speaker(s).

FIG. 5 is a flowchart of one aspect of a process 500 to route an audiosignal to either a BCT 135 or at least one speaker of the headset (e.g.,left speaker 225 and/or right speaker 230) based on identified objectswithin an environment in which the headset is located. In one aspect,the process 500 is performed by the (e.g., controller 215 of the)headset 115, as described in FIG. 2. The process 500 begins by obtainingimage data from the camera 205 that represents a scene within a field ofview of the camera (at block 505). The process 500 performs the objectrecognition algorithm, described herein, upon the image data to identifyan object contained within scene (at block 510).

The process 500 obtains an indication that output of an audio signal isdesired (at block 515). In one aspect, as described herein, theindication may be obtained from an application that is being executed bya processor of the headset. For instance, the indication may be obtainedfrom a media playback application, when output of an audio signalcontaining audio content, such as music, is desired for playback. Suchdesire may be based on user input. In another aspect, the indication mayindicate that a communication has been obtained, as described herein.The process 500 determines if the audio signal should be designated asprivate based on the identified object (at decision block 520). In oneaspect, the controller 215 performs this operation in response toobtaining the indication. The controller 215 determines if theidentified object is a known object that indicates audio signals shouldbe designated as private, and therefore be used to drive the BCT 135 inorder to output audio content contained therein. Such objects mayindicate that the user of the headset 115 is not in (or is approaching)a public area, where playing back audio through speakers is notpreferable. For example, these objects may include signs that indicatethe user of the headset is entering a quiet location, or a location thataudio/video recorded is taking place. The identified object may be alocation that is identified as not being private, such as a library or amovie theater.

In one aspect, the audio signals may be designated as private upondetermining that there are people within the field of view of thecamera. For example, the identified object may be another person, orspecifically the face or facial features of the person, such as eyes.Identifying another person's eyes within the field of view of the cameramay represent that the person is looking at the user of the headset 115because the person can hear the sound being produced by the speaker(s)of the headset. As a result, the audio signals may be designatedprivate. As another example, the audio signals may be designated asprivate when the identified object is a physical gesture that indicatesthe audio signal should private. An example of a physical gestureincludes identifying that a person is putting an index finger verticallyacross the person's lips. In one aspect, the physical gestures may bethe same or similar to the physical gestures described in FIG. 4. Upondetermining that the audio signal should be designated as private basedon the identified object, the process 500 drives the BCT 135 with theaudio signal (at block 525). Otherwise, the controller 215 drives thespeaker(s) of the headset 115 with the audio signal (at block 530).

In one aspect, the determination at decision block 520 may be based onadditional characteristics of the identified objects. For example, thedetermination may be based on whether the identified object is withinthe field of view of the camera 205 for a threshold period of time(e.g., five seconds). As an example, in the case in which the object isanother person's eyes, the controller 215 may designate the audio signalas private, when the other person's eyes are within the field of view ofthe camera 205 for at least the threshold time, which indicates that theperson is staring at the user. In another aspect, the determination maybe based on a distance between the user of the headset 115 and theidentified object. Specifically, when the identified object is a person,the audio signal may be designated as private when the person is withina threshold distance from the headset. This distance may be determinedthrough the use of computer vision triangulation in which the positionof the identified object is determined, given its projections onto twoor more digital images captured by the camera 205, relative to theheadset's known location and/or orientation when the images werecaptured. In one aspect, the user of the headset 115 may manually enterthe location of the identified object (e.g., through a user interface ofthe headset). In other aspects, the controller 215 may determine thedistance of the identified object through any known methods. In oneaspect, the threshold distance may be adjustable and based on the outputsound level of the speakers of the headset. For example, if the soundlevel is high, the distance may increase, while if the sound level islow, the distance may decrease. In one aspect, the threshold distancemay be predefined and determined in a controlled environment.

In one aspect, the determination of whether to route the audio signal tothe BCT or the speaker may be based on physical gestures identifiedwithin image data captured by the camera. Specifically, the controller215 may identify a specific gesture performed by the user according toobject recognition. For instance, image data captured by the camera 205may include hand gestures of the user of the headset 115, such as aplacing of the user's hand over the user's ear. Upon identifying thehand gesture, the controller 215 may drive the BCT with the audiosignal.

In another aspect, the determination to route (or reroute) an audiosignal to the BCT 135 may be based on ambient sound within theenvironment in which the headset 115 is located. The controller 215 mayobtain microphone signals from the microphones 210 that capture ambientsound. The controller may compare the sound level of the ambient soundto a threshold value. If the sound level of the ambient sound is above athreshold value, meaning that the environment is very noisy, thecontroller 215 may route the audio signal to the BCT 135 in order forthe user to hear the audio signal. Otherwise, if outputted through atleast one of the speakers of the headset, the signal to noise ratio maybe degraded because of the noisy environment, thereby making the audiosignal indistinguishable from the ambient sound.

Some aspects perform variations of the process 500 described in FIG. 5.For example, specific operations of this process may not be performed inthe exact order shown and described, and/or specific operations may beomitted entirely. In one aspect, the controller 215 may drive the BCT135, so long as the identified object is within the field of view of thecamera 205. For instance, at a first instance in time, the controller215 may drive the BCT with an audio signal based on an identified object(e.g., a “Please No Talking” sign). At a second instance in time, afterthe first instance in time, additional image data that represents thescene may be obtained. Upon determining that the identified object is nolonger within the scene according a performance of the objectrecognition algorithm upon the additional image data, the process 500may cease driving the BCT 135 and drive at least one of the speakers ofthe headset with the audio signal to output audio content containedtherein as sound. Conversely, if an audio signal is being used to driveat least one of the speakers of the headset 115 at a first instance intime, and it is determined that the audio signal is to be designated asprivate at a second instance in time (e.g., in response to identifyingan object contained within image data, the controller 215 may ceasedriving the speaker and drive the BCT 135 using the audio signal.

In some aspects, audio content may be outputted through both the BCT 135and at least one of the speakers of the headset 115. For example, anaudio signal may be outputted through both at the same time in order forthe user to get a fully audio experience. In one aspect, different audiosignals of the same (or similar) audio content may be outputted throughboth. For instance, the controller 215 may analyze an audio signal andextract different portions to be outputted through either one of thespeakers or the BCT 135. In the case of an audio signal includingspeech, the controller 215 may drive the speakers 225 and 230 to outputambient content of the audio signal, and drive the BCT 135 to output thespeech. Thus, the speech remains private, while the ambient content maybe heard by others in a close proximity to the user.

In one aspect, the indication obtained at block 515 may be based on theobjected that is identified within the scene. Specifically, uponidentification of an object, such as a person, the headset may obtain anindication that output of an audio signal associated with the identifiedobject is desired. For example, the audio signal may include contentdescribing physical attributes and/or identification information of theperson. For instance, the audio signal may indicate the name of theperson, what clothes the person is wearing, etc. As another example,when the identified object is an inanimate object, such as a sign, theaudio signal may describe attributes of the object (e.g., in the case ofa sign, the audio signal may contain speech describing words on thesign, the position of the sign with respect to the user, etc.). Audiosignal associated with describing such attributes and/or information maybe designated as private, since such information may be only importantto the user, and not anyone else around the user. As yet anotherexample, reminders, such as calendar reminders associated with anidentified object may be designated as private.

An aspect of the disclosure may be a non-transitory machine-readablemedium (such as microelectronic memory) having stored thereoninstructions, which program one or more data processing components(generically referred to here as a “processor”) to perform the networkoperations and signal processing operations (e.g., audio processingoperations and image processing operations). In other aspects, some ofthese operations might be performed by specific hardware components thatcontain hardwired logic. Those operations might alternatively beperformed by any combination of programmed data processing componentsand fixed hardwired circuit components.

While certain aspects have been described and shown in the accompanyingdrawings, it is to be understood that such aspects are merelyillustrative of and not restrictive on the broad disclosure, and thatthe disclosure is not limited to the specific constructions andarrangements shown and described, since various other modifications mayoccur to those of ordinary skill in the art. The description is thus tobe regarded as illustrative instead of limiting.

Personal information that is to be used should follow practices andprivacy policies that are normally recognized as meeting (and/orexceeding) governmental and/or industry requirements to maintain privacyof users. For instance, any information should be managed so as toreduce risks of unauthorized or unintentional access or use, and theusers should be informed clearly of the nature of any authorized use.

In some aspects, this disclosure may include the language, for example,“at least one of [element A] and [element B].” This language may referto one or more of the elements. For example, “at least one of A and B”may refer to “A,” “B,” or “A and B.” Specifically, “at least one of Aand B” may refer to “at least one of A and at least one of B,” or “atleast of either A or B.” In some aspects, this disclosure may includethe language, for example, “[element A], [element B], and/or [elementC].” This language may refer to either of the elements or anycombination thereof. For instance, “A, B, and/or C” may refer to “A,”“B,” “C,” “A and B,” “A and C,” “B and C,” or “A, B, and C.”

What is claimed is:
 1. A method comprising: obtaining image datacaptured by a camera of an electronic device that represents a scenewithin a field of view of the camera; performing an object recognitionalgorithm upon the image data to identify an object within the scene;obtaining an indication that output of an audio signal is desired; inresponse to obtaining the indication, determining whether the audiosignal is to be designated as private according to the object; anddriving a bone conduction transducer (BCT) of the electronic device withthe audio signal when the audio signal is designated as private.
 2. Themethod of claim 1 further comprising driving, at a first instance intime, a speaker of the electronic device with the audio signal, whereindetermining whether the audio signal is to be designated as private isat a second instance in time, after the first instance in time, whereindriving the BCT with the audio signal comprises ceasing to drive thespeaker with the audio signal.
 3. The method of claim 1, wherein drivingthe BCT with the audio signal is at a first instance in time, whereinthe method further comprises determining, at a second instance in time,after the first instance in time that the object is no longer within thescene according to the object recognition algorithm; and in response todetermining that the object is no longer within the scene, ceasing todrive the BCT, and beginning to drive a speaker of the electronic devicewith the audio signal.
 4. The method of claim 1, wherein the objectcomprises a person within the scene, wherein determining whether theaudio signal is to be designated as private comprises determining thatthe person is within a threshold distance from the electronic device. 5.The method of claim 4, wherein audio signal comprises audio contentdescribing at least one of physical attributes and identificationinformation of the person.
 6. The method of claim 4, wherein determiningwhether the audio signal is to be designated as private furthercomprises determining that the person in the scene is within the fieldof view of the camera for a threshold period of time.
 7. The method ofclaim 1, wherein the audio signal is a first audio signal, wherein themethod further comprises receiving a piece of sound program content;extracting a first portion of the piece of sound program content as thefirst audio signal and a second portion of the piece of sound programcontent as a second audio signal; and driving a speaker of theelectronic device with the second audio signal while driving the BCTwith the first audio signal.
 8. The method of claim 7, wherein the firstaudio signal includes speech and the second audio signal includesambient sound content of the piece of sound program content.
 9. Aheadset comprising: a camera; a bone conduction transducer (BCT); aprocessor; and memory having stored therein instructions which whenexecuted by the processor cause the headset to: obtain image datacaptured by the camera that represents a scene within a field of view ofthe camera; perform an object recognition algorithm upon the image datato identify an object within the scene; obtain an indication that outputof an audio signal is desired; in response to obtaining the indication,determine whether the audio signal is to be designated as privateaccording to the object; and drive the BCT with the audio signal whenthe audio signal is designated as private.
 10. The electronic device ofclaim 9 further comprises a speaker, wherein the instructions furthercomprise instructions to drive, at a first instance in time, the speakerwith the audio signal, wherein the instructions to determine whether theaudio signal is to be designated as private is at a second instance intime, after the first instance in time, wherein the instructions todrive the BCT with the audio signal comprises instructions to ceasedriving the speaker with the audio signal.
 11. The electronic device ofclaim 9 further comprises a speaker, wherein the instructions to drivethe BCT with the audio signal is at a first instance in time, whereinthe memory has further instructions to: determine, at a second instancein time, after the first instance in time that the object is no longerwithin the scene according to the object recognition algorithm; and inresponse to determining that the object is no longer within the scene,cease to drive the BCT; and begin to drive the speaker with the audiosignal.
 12. The electronic device of claim 9, wherein the objectcomprises a person within the scene, wherein the instructions todetermine whether the audio signal is to be designated as privatecomprises instructions to determine that the person is within athreshold distance from the electronic device.
 13. The electronic deviceof claim 12, wherein the audio signal comprises audio content describingat least one of physical attributes and identification information ofthe person.
 14. The electronic device of claim 12, wherein theinstructions to determine whether the audio signal is to be designatedas private further comprises instructions to determine that the personin the scene is within the field of view of the camera for a thresholdperiod of time.
 15. A non-transitory machine-readable medium havinginstructions which when executed by a processor of an electronic deviceobtain image data captured by a camera that represents a scene within afield of view of the camera; perform an object recognition algorithmupon the image data to identify an object within the scene; obtain anindication that output of an audio signal is desired; in response toobtaining the indication, determine whether the audio signal is to bedesignated as private according to the object; and drive a boneconduction transducer (BCT) of the electronic device with the audiosignal when the audio signal is designated as private.
 16. Thenon-transitory machine-readable medium of claim 15 further comprisesinstructions to drive, at a first instance in time, a speaker of theelectronic device with the audio signal, wherein the instructions todetermine whether the audio signal is to be designated as private is ata second instance in time, after the first instance in time, wherein theinstructions to drive the BCT with the audio signal comprisesinstructions to cease driving the speaker with the audio signal.
 17. Thenon-transitory machine-readable medium of claim 15, wherein theinstructions to drive the BCT with the audio signal is at a firstinstance in time, wherein the medium has further instructions to:determine, at a second instance in time, after the first instance intime that the object is no longer within the scene according to theobject recognition algorithm; and in response to determining that theobject is no longer within the scene, cease to drive the BCT; and beginto drive a speaker of the electronic device with the audio signal. 18.The non-transitory machine-readable medium of claim 15, wherein theobject comprises a person within the scene, wherein the instructions todetermine whether the audio signal is to be designated as privatecomprises instructions to determine that the person is within athreshold distance from the electronic device.
 19. The non-transitorymachine-readable medium of claim 18, wherein the audio signal comprisesaudio content describing at least one of physical attributes andidentification information of the person.
 20. The non-transitorymachine-readable medium of claim 18, wherein the instructions todetermine whether the audio signal is to be designated as privatefurther comprises instructions to determine that the person in the sceneis within the field of view of the camera for a threshold period oftime.